03. Training & Memory

03 Training Memory V1

Recurrent Layers

Here is the documentation for the main types of recurrent layers in PyTorch. Take a look and read about the three main types: RNN, LSTM, and GRU.

Hidden State Dimensions

Say you've defined a GRU layer with input_size = 100, hidden_size = 20, and num_layers=1.
What will the dimensions of the hidden state be if you're passing in data, batch first, in batches of 3 sequences at a time?

SOLUTION: `(1, 3, 20)`